Location of Processor Allocator and Job Scheduler and Its Impact on CMP Performance
نویسندگان
چکیده
High Performance Computing (HPC) architectures are being developed continually with an aim of achieving exascale capability by 2020. Processors that are being developed and used as nodes in HPC systems are Chip Multiprocessors (CMPs) with a number of cores. In this paper, we continue our effort towards a better processor allocation process. The Processor Allocator (PA) and Job Scheduler (JS) proposed and implemented in our previous works are explored in the context of its best location on the chip. We propose a system, where all locations on a chip can be analyzed, considering energy used by Network-on-Chip (NoC), PA and JS, and processing elements. We present energy models for the researched CMP components, mathematical model of the system, and experimentation system. Based on experimental results, proper placement of PA and JS on a chip can provide up to 45% NoC energy savings. Keywords—CMP, PA, JS, energy, assignment.
منابع مشابه
Memory Utilization of Processor Allocator for NoC-based Chip Multiprocessors with Mesh Topology
Chip MultiProcessors (CMPs) have become the primary method of build high-performance microprocessors. Besides speed, major elements such as processing elements and network on chip, allocation and management of on-chip processors are also important factor to achieve high efficiency of future CMPs. In this paper, the authors study a Processor Allocator (PA), especially the issue of its memory uti...
متن کاملChip Multi Processing aware Linux Kernel Scheduler
Recent advances in semiconductor manufacturing and engineering technologies have led to the inclusion of more than one CPU core in a single physical processor package. This, popularly known as Chip Multi Processing (CMP), allows multiple instruction streams to execute at the same time. CMP is in addition to today’s Simultaneous Multi Threading (SMT) capabilities, like Intel R © Hyper-Threading ...
متن کاملStudy of the Effects of Program Inputs on Co-Scheduling
Cache sharing in Chip Multiprocessors (CMPs) helps hide the latency of inter-thread communications and improve effective memory bandwidth, but also brings cache contention issues, which often cause cache thrashing and drastic performance degradation. Recent studies show the effectiveness of job co-scheduling in alleviating cache contention, but without exploring the impact of program inputs on ...
متن کاملImprove WiMAX Network Performance Using Cross-Layer Framework
WiMAX OFDMA downlink subframes have a special 2-D channel time structure. The resource allocations from this structure incur extra control overheads that hurt network performance. Existing solutions try to improve network performance by designing either the scheduler in the MAC layer or the burst allocator in the physical layer, but the efficiency of overhead reduction is limited. In this paper...
متن کاملA Minimal Fragmentation Algorithm for Task Allocation in Mesh - Connected Multicomputers
—Efficient allocation of processors to incoming tasks in tightly coupled systems is crucial for achieving high performance. A good allocation algorithm should identify available processors with minimum overhead. In addition, it should be submesh recognition complete and should minimize fragmentation as far as possible. In this paper, we propose an efficient task allocation mechanism called the ...
متن کامل